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Abstract: This note introduces the concept of a partially specified 

prior distribution for certain post hoc inference problems, where a fi- 

rr^ , nite population is sampled once in order to make a decision on the 

r^ . presence or complete absence of some attribute. If the decision is made 

Cy ■ to accept complete absence, a probability statement may be required 

that the population is indeed free of the attribute. A partially specified 

prior is shown to be advantageous in making such statements realistic 

^ , and useful. 
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1. Introduction 

Consider a finite population of size N in which A of the elements posses a 

certain characteristic or attribute. Often the desired value for A is zero, and 

the decision problem is {A = 0} versus {A > 0}. For example, certification 

may be required that a population of animals is free from an infectious 

disease, as against having one or more infected animals. Alternatively, in a 

quality inspection setting, we may require {A < k} where k <ti N. 
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Suppose that a simple random sample of size n without replacement is 
drawn, from which a of the sampled elements are found to possess the at- 
tribute. When a > 1, the inference on whether j4 = or not is obvious. 
When a = 0, a decision may be made to accept the hypothesis that ^ = 0, 
or ^ < A;. The consequent issue after making this decision is to ascertain 
the probability that A = {} or A < k. One indirect way of answering this 
question is to construct confidence or credible limits for A; see, for exam- 
ple, see [7| and 3| . The problem of constructing a uniformly most accurate 
100 (1 — a) % upper confidence bound for A is addressed in 9|. It was also 
remarked in [9[ that if one is to be 99% "confident" that the population 
is free from the set attribute, then the sampling fraction n/N must be at 
least 99%. The use of Bayesian methods was suggested by [4]. It was also 
commented that Pr {A = 0|a = 0} is of practical interest. 

The rest of this note is organised as follows. In Section 2, we review the 
results given in 2] and discuss the determination of Pr {A = 0|a = 0}. In sec- 
tion 3, the concept of partially specified prior is introduced and illustrated. 
The final section provides the conclusions. 

2. Determination of Pr {A = 0|a = 0} 

For a given discrete prior distribution p(^) on the support Z^+i = {0, 1, 2..., N}^ 
the following solution is provided in 4]): 
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Pr{^ = 0|a = 0} 
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In the case of a uniform prior p (A) = 1/ {N + 1), see 



(2.1) 



Pr {A = 0\a = 0} = (n + 1) / (iV + 1) 



(2.2) 



Equation (j2.2p suggests that the saniphng fraction n/N must be very large 
to achieve a large value for Pr {A = 0\a = 0} such as 0.95 or 0.99. This is a 
consequence of the employment of a "non-informative" prior distribution. 

A Bayesian approach for the determination of credible intervals for the 
fraction nonconforming, using a uniform prior with an assumed upper bound 
M was discussed in [J]. Using their approach we obtain 



Pr{A = 0|a = 0} 



or equivalently 



N 
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Pr{y4 = 0|a = 0} 
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Again the sampling fraction n/N must be very large to achieve a large 
posterior probability, which leads to the consideration of other, perhaps more 
realistic, priors. 

n 

The mixed binomial prior distribution was originally proposed by |1| in 
a quality control context. This prior was later generalised in [8]. The ba- 
sic mixed binomial prior distribution given in [1] consists of two binomials 
which are mixed with known probabilities of mixing. This prior, originally 
suggested for lot by lot sampling inspection, is of the form 

(aj,pj), (i = 1,2) with ai + 02 = 1 

where Ui is the probability with which a lot has a proportion pi non- 
conforming. The basic idea behind the mixed binomial prior is that lots 
are either of acceptable quality pi, under common cause variations in the 
production process, or of unacceptable quality p2 (> Pi) in the presence of 
special causes. One advantage of the mixed binomial prior is that it takes 
the structure of the inspection problem into account. The main purpose 
of sampling inspection is to dispose the lot under consideration either as 
acceptable or unacceptable. The mixed prior distribution represents the two 
subgroups, "good" and "bad", of the superpopulation (production process) 
from which the lot under consideration is formed. 

A practical problem with the mixed binomial prior is that it is difficult 
to specify in full. Whilst some knowledge of the characteristics of common 
cause variation might be expected, special causes by their very nature are 
unpredictable and unquantifiable. In the epidemiological context, some in- 
formation may be forthcoming about background rates of infection, but not 
about sporadic outbreaks of disease. There may in fact be many kinds of 
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"bad" lot, requiring a prior with many components. We note however that 
full specification of the prior is not necessary if one considers the two-stage 
nature of the inferential procedure. First a decision is made, based on an 
explicit acceptance rule, as to whether the lot at hand is "good" or "bad". 
Subsequently, a probability statement on the number of non-conformances 
is only required when a lot is accepted as "good". At this point we can 
dispense with those components of the prior that reference "bad" lots, and 
work only with the component that references a "good" lot. Thus the prior 
only needs to be specified for the "good" lots. 

This differs from the usual Bayesian inference in that, instead of updating 
the prior based on the data to give p(A|Data), we are using the data to make 
a decision and then updating the prior based on the decision, p(A|Decision). 
We define the term "partially specified prior" or PSP as the required com- 
ponent of the full prior specified for post hoc inference after a decision on the 
population of interest is made. We do not advocate that such partial spec- 
ification is possible or advisable for all inference problems. This approach 
is suitable only for problems whose prior distributions can be thought to 
be mixtures of two or more individual distributions, and where an explicit 
decision rule has been agreed a priori. 

Strictly speaking, the term "prior" refers to the specification of the full 
distribution of the parameters before obtaining data. We suggest that the 
use of a PSP can be interpreted as similar to the adjusting the belief struc- 
tures discussed in |0]. The lack of general methods for dealing with partial 
specification of probability structures is also highlighted in g] . 
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3. Partially specified prior for Pr {A = 0|a = 0} 

For decision problems on a finite population, a decision is first made based on 
the sample data. For example, a batch of items may be accepted if the sam- 
pled items are all conforming. The sample size for such acceptance sampling 
problems is not determined purely based on statistical grounds (Bayesian or 
otherwise). Cost and other industry standards or mutual contracts between 
the producer and the consumer dictate such sample sizes. If the lot is ac- 
cepted after the observance of a = 0, then evidently < Pr {A = 0} < 1 for 
n < N . The relevant post hoc inference question is what is Pr {A = 0} or 
perhaps Pr{A < k} where A; is a small number. To answer such questions, 
consideration of an appropriate prior can be related to the structure of the 
problem, namely that the hypothesis A = will be accepted under the given 
sampling plan if a = is observed. Since a post hoc probability statement 
is only required following acceptance of a lot, we can ignore those compo- 
nents of the prior related to bad quality lots, and work only with a partial 
specification of the prior information. We assume that some prior knowl- 
edge of belief concerning the good lots is available, perhaps in the form of a 
BinomialfA^, 5) comprising that part of the mixed binomial prior pertaining 

n 

to a good lot given in [l]. Note that the structure of the problem in hand is 
important to adopt a mixture prior and then separate out the relevant part 
of it to form the partially specified prior for post hoc inference. 

Even though we focus largely on the case a = in this note, the suggested 
binomial partially specified prior is a conjugate prior for all values of a since 
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PiA\a) 

oc P{a\A)pps{A) 

°^ (A-a)\{N-A-n+ay. A\{N-A)\" V^ ") ^ ' 

(N — n\ 
A-a I 
for A = a, a+l,...,A^, implying that the posterior distribution oi A — a 
is Binomial(A^ — n, 5). 

The parameter 5 for the partiahy specified prior ppg{A) can be ehcited 
as the expected proportion of elements having the attribute in a good lot. 
Note that: 



N-n 



P{a = 0) = ^ P(a = 0) X ppsi 

N-A \ I N \ \ IN 




A 



(3.2) 
Clearly (1 — 5)" is the probability that all n items are free from the 

attribute interest in which 6 is the probability of an element having the 

attribute in a very large population. 
From equation (j3.ip . we obtain 



Pr{^ = 0|a = 0} = (l-<5)''-" (3.3) 
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As an example, consider the international Standard ISO 2859-1:1999 J5| 
normal inspection plan for the lot size N = 3200 and Acceptance Quality 
Limit {AQL) of 0.10%. The sampling plan n = 125 and Ac = is listed 
in the ISO Standard corresponding to the desired AQL and lot size. After 
acceptance of the lot, the belief of the consumer improves. The term AQL 
is the maximum percent nonconforming that is considered satisfactory as 
a process average p. After acceptance of the lot, the improved consumer's 
belief is that p < AQL. Hence a plausible value for 5 is ^.bAQL. Based on 
this value for 5, we find from equation (j3.3p that PrjA = 0|a = 0} = 0.215 
under the plan n = 125 and Ac = 0. his figure is much larger than 0.0394 
found using the uniform prior (equation (j2.2|) ). 

Finally we examine briefiy the case of a non-zero acceptance number, 
where a lot is accepted on observance of a < Ac where Ac > 0. Equation 
(j3.ip can be used to calculate the probability that the lot is of acceptable 
quality after acceptance of the lot namely: 

fc-a / TV - n \ 
P(A < k\a) = ^ ^^""(1 - 5)^-^-"+'^ (3.4) 

A=o \ A- a J 

where k = int (NAQL). For example, consider the normal inspection plan 

n = 200and Ac = 2 corresponding to A^ = 10,000 and AQL = 0.40% from 

the ISO 2859-1:1999 Standard. For given a = 1, k = 10000 x 0.004 = 40, 

and (5 = 0.002, equation ([331) yields 99.9%. 
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4. Concluding Remarks 

This paper introduces the concept of partially specified prior for post hoc 
statistical inference issues such as making probability statements about a 
finite population of interest after certain decisions are made concerning the 
population. The suggested partial specification of the prior is useful for qual- 
ity control and other applications. 
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